Why the Normal Distribution?
نویسنده
چکیده
This short note explains in simple terms why the normal distribution is so ubiquitous in pattern recognition applications. As we will see, the answer comes from information theory: when we compress a data set by keeping only the mean and variance of the problem’s classes, the distribution which allows us to keep on working probabilistically, while making minimal assumptions about the data, is, precisely, the normal distribution. Why the normal distribution? The geek store offers a t-shirt with the legend “let < 0 . . .”. That is the complete punch line. Similarly, in pattern recognition applications people would raise their heads if someone said “let us modell the data clusters, about which we know almost nothing, by using an xy distribution”, where “xy” is not the word “normal”. In other words: you better have good reasons to propose modelling data clusters with anything else than a Gaussian, or at least a mixture of Gaussians. But then, why is the normal distribution so prevalent in pattern recognition applications? The classical book by Duda & Hart, for example, starts right with the Bayes rule and the multivariate normal distribution [1]. The Gaussian (normal) distribution, is used in many pattern recognition problems as an easy way of modelling the probability density of experimental data.
منابع مشابه
Variables Characteristics Effects on Static and Pseudo-Static Reliability-Based Design of near Slope Shallow Foundations
Given the concept of reliability-based design (RBD) and the growing risk management trend in geotechnical engineering, proper understanding and quantification of uncertainties are very important. The complexity of methods and a large volume of calculations of probabilistic design methods are the most critical reasons for civil engineers not to be comfortable using these approaches. In this rese...
متن کاملDeveloping Non-linear Dynamic Model to Estimate Value at Risk, Considering the Effects of Asymmetric News: Evidence from Tehran Stock Exchange
Empirical studies show that there is stronger dependency between large losses than large profit in financial market, which undermine the performance of using symmetric distribution for modeling these asymmetric. That is why the assuming normal joint distribution of returns is not suitable because of considering the linier dependence, and can be lead to inappropriate estimate of VaR. Copula theo...
متن کاملLinear regression, the normal distribution of error values or normal distribution of the dependent variable?
This article has no abstract.
متن کاملGeneralizations of the Skew t-Normal Distribution and their Properties
In this paper we consider several generalizations of the skew t-normal distribution, and some of their properties. Also, we represent several theorems for constructing each generalized skew t-normal distribution. Next, we illustrate the application of the proposed distribution studying the ratio of two heavy metals, Nickel and Vanadium, associated with crude oil in Shadgan wetland in the south-...
متن کاملA Flexible Skew-Generalized Normal Distribution
In this paper, we consider a flexible skew-generalized normal distribution. This distribution is denoted by $FSGN(/lambda _1, /lambda _2 /theta)$. It contains the normal, skew-normal (Azzalini, 1985), skew generalized normal (Arellano-Valle et al., 2004) and skew flexible-normal (Gomez et al., 2011) distributions as special cases. Some important properties of this distribution are establi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010